SEQUENCE LISTING - WEE virus strain 71V- 1658 - SEQ ID NO 1 



. LOCUS AF214040 11484 bp RNA VRL 27-JAN-2000 

DEFINITION Western equine encephalomyelitis virus strain 71V-1658 
nonstructural polyprotein and structural polyprotein genes, 

complete cds . 
ACCESSION AF214040 AF143811 
VERSION AF214040.1 GI: 6760410 

KEYWORDS 

SOURCE Western equine encephalomyelitis virus. 

ORGANISM Western equine encephalomyelitis virus Viruses; ssRNA 
positive-strand viruses, no DNA stage; Togaviridae; Alphavirus. 
REFERENCE 1 (bases 1 to 11484) 

AUTHORS Netolitzky , D . J . , Schmaltz, F.L. , Parker, M.D., Rayner,G.A., 

Fisher, G.R., Trent, D.W., Bader,D.E. and Nagata,L.P. 
M= TITLE Complete genomic RNA sequence of western equine 

0 encephalitis virus and expression of the structural genes 
P JOURNAL J. Gen. Virol. 81 Pt 1, 151-159 (2000) 

f!j MEDLINE 2 010 9043 

UJ REFERENCE 2 (bases 1 to 114 84) 

01 AUTHORS Netolitzky, D . J. , Schmaltz , F . L . , Parker, M.D., Rayner,G.A., 
=p Fisher, G.R., Trent, D.W., Bader,D.E. and Nagata,L.P. 

J3 TITLE Direct Submission 

2 JOURNAL Submitted (08-DEC-1999) Medical Countermeasures Section, 
M= Defence Research Establishment Suf field, P.O. Box 4 0 00, Stn Main, 

Medicine Hat, Alberta T1A 8K6, Canada 
HJ FEATURES Location/Qualifiers 
H> source 1.. 11484 

O /organism= "Western equine encephalomyelitis virus" 

M= /strain="71V-1658" 

/db_xref="taxon: 1103 9" 

5'UTR <1..24 

misc_feature 1.. 11484 

/note=" sequence of this entry added to the sequence of 
AF214 03 9, strain CBA87, constitutes the Western equine 
encephalomyelitis virus complete genome" 



CDS 25 . . 7428 

/note=" encodes nsPl, nsP2 , nsP3 and nsP4 proteins" 
/codon_start=l 

/transl_except= (pos : 5581 . . 5583 , aa: OTHER) 
/product= "nonstructural polyprotein" 
/protein_id="AAF28339 . 1" 
/db_xref = "GI : 6760411 » 

/translati on = " MER I HVDLDAD S P YVKS LQRTF PQ FE I EARQVTDNDHANARAF S 

HVATKL I E S EVDRDQVI LD I G S AP VRHAHSNHR YHC I C PM I SAEDPDRLQRYAERLKK 



SDITDKNIASKAADLLEVMSTPDAETPSLCMHTDATCRYFGSVAVYQDVYAVHAPTSI 
YHQALKGVRTIYWIGFDTTPFMYICNMAGSYPTYNTNWADERVLEARNIGLGNSDLQES 
RLGKLSILRKKRLQPTNKIIFSVGSTIYTEDRSLLRSWHLPNVFHLKGKSNFTGRCGT 
IVSCEGYVIKKITISPGLYGKVENLASTMHREGFLSCKVTDTLRGERVSFAVCTYVPA 
TLCDQMTGILATDVSVDDAQKLLVGLNQRIWNGRTQRNTNTMQNYLLPWAQAFSRW 
AREHRADLDDEKELGVRERTLTMGCCWAFKTQKITSIYKKPGTQTIKKVPAVFDSFVI 
PRLTSHGLDMGFRRRLKLLLEPTVKPAPAITMADVEHLRGLQQEAEEVAAAEEIREAL 
PPLLPEIEKETVEAEVDLIMQEAGAGSVETPRGHIRVTSYPGEEKIGSYAILSPQAVL 
NSEKLACIHPLAEQVLVMTHKGRAGRYKVEPYHGKVIVPEGTAVPVQDFQALSESATI 
VFNEREFVNRYLHHIAINGGALNTDEEYYKTVKTQDTDSEYVFDIDARKCVKREDAGP 
LCLTGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVYGVPGSGKSGIIKSAVTKKDLW 
SAKKENCAE 1 1 RDVRRMRRMDVAARTVDSVLLNGVKHPVNTLYIDEAFACHAGTLLAL 
IAIVKPKKVVLCGDPKQCGFFNMMCLKVHFNHDICTEVYHKSISRRCTQTVTAIVSTL 
FYDKRMKTVNPCADKI I I DTTGTTKPHKDDL I LTCFRGWVKQLQ I D YKNHE I MTAAAS 
QGLTRKGWAVRYK\rNENPLYSQTSEHVNVLLTRTEKRIVWKTLAGDPWIKTLTAKYP 
GDFTASLDDWQREHDAIMARVLDKPQTADVFQNK^/NVCWAKALEPVLATANIVLTRQQ 
WETLHPFKHDRAYSPEMALNFFCTRFFGVDLDSGLFSAPTVALTYRDQHWDNSPGKNM 
YGLNREVAKELSRRYPCI TKAVDTGRVADI RNNTI KDYSPTINWPLNRRLPHSLI VD 
HKGQGTTDHS G FL S KMKG KS VL VI GDP I S I PGKKVE SMGPLPTNT I RCDLDLG IPS HV 
GKYDIIFVNVRTPYRNHHYQQCEDHAIHHSMLTCKAVHHLNTGGTCVAIGYGLADRAT 
ENIITAVARSFRFTRVCQPKNTAENTEVLFVFFGKDNGNHTHDQDRLGWLDNIYQGS 
TRYEAGRAPAYRVIRGDI SKSADQAIVNAANSKGQPGSGVCGALYRKWPAAFDRQP IA 
VGTARLVKHEPLIIHAVGPNFSKMPEPEGDLKLAAAYMSIASIVNAERITKISVPLLS 
TGIYSGGKDRVMQSLHHLFTAFDTTDADVTIYCLDKQWETRIIEAIHRKESVEILDDD 
KPVDIDLVRVHPNSSLAGRPGYSVNEGKLYSYLEGTRFHQTAKDIAEIHAMWPNKSEA 



3* 



NEQICLYILGESMSSIRSKCPVEESEASAPPHTLPCLCNYAMTAERVYRLRSAKKEQF 

AVCSSFLLPKYRITGVQKLQCSKPVLFSGWPPAVHPRKYAEIILETPPPPATTTVIC 

EPTVPERIPSPVISRAPSAESLLSLGGVSFSSSATRSSTAWSDYDRRFWTADVHQAN 

TSTWSIPSAPGLDVQLPSDVTDSHWSIPSASGFEVRTPSVQDLTAECAKPRGLAEIMQ 

D FNTA PFQ FL S D YR P VPAPRRR P I P S PRS TAS AP PVPKPRRTKYQQ P PGVARAI S EAE 

LDE Y I RQH SNXRYE AGAY I F S S ETGQGHLQQKS VRQCKLQE P I LDRAVHEKY YAPRLD 

LERE KMLQ KKLQLCAS E GNR SR YQ S RKVENMKAI TAERL I SGLGT YL S S EVNP VE C YR 

VNYPVPIYSSTVINRFTSAEVAVKTCNLVIQENYPTVASYCITDEYDAYLDMVDGASC 

CLDTATFCPAKLRSYPKKHSYLQPEIRSAVPSPIQNTLQWIJ=LAATKRNCMVTQMREL 

PVLDSAAFNVDCFKKYAC3STDEYWDTFRDNPIRLTTENVTQYVTKLKGPKAAALFANTH 

NLKPLQE I PMDQFVMDLKRDVKVTPGTKHTEERPKVQVIQAADPLATAYLCGI HRELV 

RRLNAVLLPNIHTLFDMSAEDFDAI IAEHFHHGDPVLETDIASFDKSEDDAIAI SALM 

I LEDLGVDQPLLDLI EAAFGNI TSVHLPTGTRFKFGAMMKSGMFLTLFVNTLVNIMIA 

SRVLRERLTTSACAAS I GDDNI VHGWSDTLMAERCATWLNMEVKI IDAVIGI KAPYF 

CGGFILVDQITGTACRVADPLKRLFKLGKPLPVDDTQDCDRRRALHDEAMRWNRIGIT 

DEL VKAVE S RYE 1 1 LAGL 1 1 T SL S TLAE SVKNFKS I RGS P I TLYG » 

CDS 7473 . . 11183 

/not e= "encodes nucleocapsid, E3 , E2, 6K and El proteins" 
/codon_start=l 

/product = " structural polyprotein" 
/protein_id= "AAF2834 0 .1" 
/db_xref=»GI: 6760412" 

/translat ion= "MFPYPQLNFPPVYPTNPMAYRDPNPPRRRWRPFRPPLAAQIEDL 

RRSIVNLTFKQRSPNPPPGPPPKKKKSAPKPKPTQPKKKKQQAKRTKRKPKPGKRQRM 

CMKLESDKTFPIMLNGQVNGYACWGGRLMKPLHVEGKIDNEQLAAVKLKKASMYDLE 

YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTVPRGVGGKGDSGRPILDNR 

GRWAIVLGGANEGTRTALSWTWNQKGVTIRDTPEGSEPWSLVTALCVLSNVTFPCD 



3/ 



KPPVCYSLTPERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRSITDDFTLTSPYLGF 

CPYCRHSTPCFSPIKIENVWDESDDGSIRIQVSAQFGYNQAGTADVTKFRYMSFDHDH 

DIKEDSMEKIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASENSCTVEKKIR 

RKFVGREEYLFPPVHGKLVKCHVYDHLKETSAGYITMHRPGPHAYKSYLEEASGEVYI 

KPPSGKNVTYECKCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSPDLIRHT 

DHSVQGKLHI PFRLTPTVCPVPLAHTPTVTKWFKGI TLHLTAMRPTLLTTRKLGLRAD 

ATAE W I TG S T S RNF S VGREGLEYVWGNHE P VRVWAQESAPGDPHGWPHE I I IHYYHRH 

PVYTVIVLCGVALAILVGTASSAACIAKARRDCLTPYALAPNATVPTALAVLCCIRPT 

NAETFGETLNHLWFMSTQPFLWAQLCIPLAALVILFRCFSCCMPFLLVAGVCLGKVDAF 

EHATTVPNVPGIPYKALVERAGYAPLNLEITWSSELTPSTNKEYVTCKFHTVIPSPQ 

VKCCGSLECKASSKADYTCRVFGGVYPFMWGGAQCFCDSENTQLSEAYVEFAPDCTID 

HAVALKVHTAALKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIAGPISAAFSPFDH 

KWIRKGLVYNYDFPEYGAMKPGAFGDIQASSLDATDIVARTDIRLLKPSVKNIHVPY 

TQAVS GYEMWKNNSGRPLQETAPFGCKI EVE PLRASNCAYGH I P I S I D I PDAAFVRS S 

ESPTILEVSCTVADCIYSADFGGSLTLQYKADREGHCPVHSHSTTAVLKEATTHVTAV 

GSITLHFSTSSPQANFIVSLCGKKSTCNAECKPPADHIIGEPHKVDQEFQAAVSKTSW 
NWLLALFGGASSLIWGLIVLVCSSMLINTRR" 

3'UTR 11184.. 11484 



BASE COUNT 3251 a 2842 c 2808 g 2583 t 

ORIGIN 

1 accctacaaa ctaatcgatc caatatggaa agaattcacg ttgacttaga tgctgacagc 
61 ccgtatgtca agtcgttaca gcggacgttt ccacaatttg agatcgaagc aaggcaggtc 
121 actgacaatg accatgccaa tgccagagcg ttttcgcatg tggcaacaaa gctcattgag 
181 agcgaagtcg accgggacca agttatcttg gacattggaa gtgcgcccgt cagacatgca 
241 cattccaatc accgctatca ttgtatctgc cctatgataa gcgctgaaga cccggacaga 
301 ctacaacggt atgcagaaag acttaagaaa agtgacatta ccgacaagaa catagcctct 
361 aaggcggcag acctgctgga agtcatgtca acaccagacg cagagactcc atctctgtgt 
421 atgcacacag acgccacgtg taggtacttt ggaagtgtag cagtatacca agatgtgtac 
481 gcagtccatg caccgacatc aatctaccac caggcgctta aaggagttag gacaatttac 
541 tggataggct ttgacacgac cccttttatg tacaaaaaca tggcaggttc ctaccctact 



601 tacaacacga actgggctga cgagagagta 
661 tcagatcttc aggagagcag gcttggaaaa 
721 cctactaata agatcatatt ctcggttggt 
781 ttacgtagct ggcatcttcc aaacgtgttc 
841 agatgtggga ccattgtcag ctgtgaaggg 
901 ggactatacg gtaaagttga gaacttggcg 
961 tgcaaagtca cagatacgct gcgcggcgag 
1021 ccagccacac tttgcgatca gatgacaggg 
1081 gcacaaaaac tattggttgg gctcaaccaa 
1141 aatactaaca caatgcagaa ctatctatta 
12 01 gcgcgtgaac atcgtgccga cttggacgac 
1261 cttactatgg gctgctgctg ggctttcaag 
1321 cctggtacgc aaacaattaa gaaagtacct 
1381 cttaccagcc acgggctcga tatgggcttc 
1441 actgtcaaac ccgcaccggc tattacaatg 
1501 caagaagctg aagaagtggc tgcagcggaa 
1561 cctgaaatag aaaaagagac cgtagaggca 
1621 gcaggtagcg tggagacacc acgaggacac 
1681 aagattgggt cttacgctat actttcaccc 
1741 tgtatccacc cattggcgga acaagtactg 
1801 tacaaagtcg agccatacca cggtaaggtc 
1861 caagacttcc aggcattgag tgagagcgct 
1921 aacagatacc tgcaccacat cgcaatcaac 
1981 tataagactg taaagactca ggacacagac 
2041 aagtgtgtta agcgagaaga cgcaggtccc 
2101 ccatttcacg agtttgcgta cgagagtctc 
2161 ccaaccatcg gagtctatgg agtgccaggt 
2221 gtgactaaga aagatctggt tgtgagtgcg 
22 81 gatgtaagga ggatgagacg tatggatgtt 
2341 aatggggtta agcaccccgt taacactctg 
24 01 gggacgctgc tggcactgat tgccatcgtc 
24 61 ccaaaacaat gcggcttctt taacatgatg 
2521 tgcactgaag tgtaccataa aagcatctct 
2581 gtctccacgc tcttctacga caagcgaatg 
2641 atcatagata ccacagggac cacaaagccg 
2701 agaggatggg tgaaacagct acagattgac 
2761 gcatcgcaag gacttacgcg gaaaggcgtt 
2821 ccactctact cgcagacttc tgagcacgtg 
2881 attgtctgga agacgctagc tggtgatccc 
2941 ggggatttca cggcttcatt ggacgactgg 
3001 gttcttgata agccgcagac agctgatgtg 
3061 aaggctttag agccagtctt ggccacggcc 
3121 acgttgcacc cattcaagca tgacagagcg 
3181 tgcaccaggt tctttggagt agacctggac 
3241 cttacttaca gggatcagca ctgggataac 
33 01 agagaggtag caaaggagtt gtcacggcga 
3361 ggcagggtag ctgatataag gaataatacc 
3421 gttccattaa atcgccggtt gccccactcg 
3481 actgatcaca gcggattcct atctaagatg 
3541 cctatcagca ttccagggaa gaaagtagag 



ttggaagcac gtaacattgg cctcggtaac 
ctctcaatcc ttaggaagaa gaggctccaa 
tcaacaatct acacagaaga tagatcactg 
cacttgaaag gaaagtctaa cttcacaggt 
tacgtcatca aaaagataac gatcagccca 
tccacaatgc atcgcgaggg tttcttgagt 
agggtttctt ttgctgtgtg tacgtatgta 
attctggcaa ctgacgttag tgtggatgac 
aggattgtcg tcaatggtag gacgcaaaga 
ccagtggtcg cccaggcgtt ttccaggtgg 
gagaaagaac taggggtgcg ggagcgcact 
acccagaaaa tcacatccat ctacaagaag 
gccgtctttg actcatttgt gattccacgc 
cgccgtaggc tcaagctgct gcttgaacca 
gccgatgtgg agcatctgcg tggcttacag 
gagatcagag aagccctgcc acccttgctc 
gaagtagacc tcattatgca agaggcagga 
atcagggtga caagttaccc aggcgaagag 
caggcggtat tgaatagtga aaaactggcg 
gtaatgactc acaaaggtag ggcagggaga 
attgtaccag aagggacggc ggtccctgtt 
acgatcgttt tcaacgagag ggagttcgta 
ggaggagcgc taaacactga cgaagagtac 
tcagaatacg tcttcgatat tgacgcacga 
ttgtgcctaa ccggtgatct ggtagatcca 
aagacacgac cagcagcacc tcacaaagtc 
tcaggtaaat ctggaatcat caaaagcgct 
aagaaggaaa actgcgcaga aatcatcagg 
gctgctagga ctgtcgattc agtgcttcta 
tacattgatg aggcatttgc ctgccatgca 
aaacctaaga aagtggtatt gtgcggggac 
tgcctgaaag tacattttaa ccatgacata 
aggaggtgca cacagactgt aaccgccatc 
aagacggtta acccatgtgc tgataaaatc 
cacaaagatg atctgattct aacctgtttc 
tacaaaaatc acgaaatcat gactgcggct 
tatgctgtca ggtacaaagt caacgagaat 
aacgtgttac ttacacgcac agaaaaacgc 
tggataaaga cacttacagc taaatatccc 
cagcgcgaac acgacgccat tatggcacgc 
ttccagaata aggtgaacgt ctgctgggcg 
aacattgtgc tgacgagaca gcagtgggag 
tactcacctg aaatggcact gaacttcttt 
agtgggttat tttccgctcc taccgtcgca 
tcgccaggga agaacatgta tgggcttaat 
tatccgtgca tcacaaaagc ggttgacaca 
atcaaggact actctccaac aattaatgtg 
ttgatcgttg accacaaagg acagggtaca 
aagggcaaat ctgtgttggt gatcggcgat 
tccatgggtc cattgcccac taataccatc 



3601 aggtgtgatc tcgatttggg aatacctagc 
3661 aatgttagga ccccgtacag gaaccatcac 
3721 cacagcatgc taacgtgtaa ggctgtccac 
3 781 atagggtatg ggcttgctga tcgcgcaacc 
3841 tttaggttta cccgtgtctg tcagcctaag 

3 901 gtgttcttcg gcaaggacaa cggcaaccac 
3961 cttgacaaca tctatcaagg gtcaaccagg 
4021 gtgatcagag gtgacattag caagagcgct 
40 81 aaaggtcaac caggttccgg agtgtgcggt 
4141 gatagacagc caatagctgt cgggacggct 
4201 catgctgtag gacccaattt ttctaagatg 
42 61 gctgcctaca tgagcatagc gtccatcgtc 
4321 ccgctactgt caaccggcat ctattctggt 
4381 cacctgttca ctgctttcga cactacggat 
4441 caatgggaga ccaggataat cgaggccatt 
4501 gatgacaagc cagtagacat tgacttggtc 
4561 agaccaggtt actccgtcaa tgagggcaag 
4621 catcagaccg ccaaggacat tgccgaaatc 
4681 aatgagcaga tttgcttgta catcctgggg 
4741 ccagtagagg agtcagaggc gtctgctcca 
4801 gctatgacgg ctgagcgcgt atacaggttg 
4861 tgctcatcat tcctgttgcc gaagtacagg 
4921 aaaccagtcc tgttttcagg cgtcgtacca 

4 981 ataattctag aaacgccacc accgccagca 
5041 ccagaacgta tacccagtcc ggtgatttct 
5101 cttggcggcg tctcgttctc tagctctgcc 
5161 gacaggcggt ttgtggttac agctgatgtg 
5221 cctagtgctc ctggcttgga cgtccagctg 
5281 attccaagtg catcaggctt tgaagtgaga 
5341 tgtgcgaagc ctcgtggact ggccgaaata 
5401 tttctttcgg actacagacc agtaccggca 
5461 tcgacggctt ccgcacctcc agttccaaag 
5521 ggagtcgcta gagcgatctc agaagcggag 
5581 tgacggtatg aagcgggagc gtatattttc 
5641 cagaaatcag tacgtcaatg taaactacaa 
57 01 aagtattacg ccccgcgcct cgatctcgaa 
5761 ttatgcgcct ctgaaggaaa tagaagcagg 
5821 gcaattacag cggagcgact catttctgga 
5881 cctgtcgagt gttacagagt caattatcct 
5941 aggtttacat ctgcagaggt cgcggttaaa 
6001 cctacagtag ccagttattg tataacagat 
6061 ggcgcatcgt gctgtctaga tacagccact 
6121 aagaagcata gctatttgca gccagagata 
6181 acattacaaa atgtattggc tgcagctact 
6241 gaattacctg tcttagattc ggcggcattt 
6301 aatgatgagt actgggatac ctttcgcgat 
6361 acgcaatatg tgacaaagct gaaagggccg 
6421 aatctaaaac cgttgcagga gataccaatg 
6481 gtcaaagtta ctcccggcac gaaacataca 
6541 gctgcagatc cccttgctac cgcttacctt 



catgtcggta aatatgacat tatctttgtc 
taccaacagt gcgaggatca cgctatccac 
cacctgaaca ctggcggaac atgtgtggct 
gagaatatca tcactgcggt ggcacgctca 
aacactgccg aaaatactga ggttctcttc 
acacatgacc aggacagact cggtgtagtg 
tacgaggcag ggagagctcc agcgtacaga 
gaccaagcta tcgttaatgc tgctaatagc 
gcactgtacc gaaaatggcc ggctgctttt 
agacttgtga agcacgaacc gctcatcata 
ccggaaccgg agggcgacct taagctcgca 
aacgctgagc ggattacaaa aatatcagta 
ggcaaagatc gagtgatgca atcattgcat 
gccgatgtca ccatatattg cttggataaa 
caccgcaaag aaagcgtcga aattctggat 
agggtccacc caaacagctc tttggcaggc 
ttgtattcat acctggaagg tacacgattc 
catgcaatgt ggcccaacaa atctgaggct 
gagagtatgt ccagcatccg ctccaaatgc 
cctcacacac ttccatgcct gtgtaattac 
cgctctgcga agaaagaaca gttcgccgta 
atcacaggcg tgcagaagct acagtgcagc 
ccggctgtac accccaggaa gtacgcggaa 
acgacaaccg taatatgtga accaactgtg 
agagcaccaa gtgcggaatc actgctatcg 
acacgctcgt caaccgcctg gagcgactat 
catcaagcga acacgtctac gtggagcatc 
ccttctgacg tcactgattc ccactggagt 
acaccatctg tacaggacct aactgcggag 
atgcaagact tcaatactgc tcctttccag 
ccacggagac gccccatccc atcacctaga 
ccacgcagga ctaagtacca acaaccacca 
ttggacgagt acatccgtca acactccaac 
tcatcggaaa caggccaagg tcaccttcaa 
gaacctatat tggatcgggc cgtccatgag 
agagagaaaa tgttacagaa gaaactgcaa 
tatcaatcac gaaaagtaga aaatatgaaa 
ttgggcacat atctatcatc agaagtgaat 
gtaccaatct actcgtcaac ggtaattaac 
acgtgcaact tagttatcca agagaattac 
gaatacgatg cgtatcttga catggtggac 
ttttgtccgg ctaaactgag aagctaccca 
agatcagccg tcccatcgcc tatacagaat 
aaaaggaatt gcaacgttac ccaaatgcga 
aatgttgatt gtttcaagaa atacgcatgc 
aaccctattc ggctaactac agagaacgtt 
aaagcagcag cattgtttgc gaatactcat 
gatcaattcg tcatggatct aaagagagat 
gaggagcggc ctaaggtgca ggttattcag 
tgcgggatcc atcgggaatt agtccgtaga 



ft 



6601 ctgaatgcgg tgcttctgcc aaatatccat 
6661 gatgcgatta ttgctgaaca tttccaccac 
6721 tcgtttgata aaagcgaaga cgacgctatc 
6781 ttaggtgtcg accaaccgct cttagatttg 
6841 gtgcacctac ctacaggaac gaggtttaaa 
6901 ttaacgctgt ttgtcaacac actagtcaat 
6961 cggttaacca cgtcagcgtg cgcggcctct 
7021 gtctccgaca ccttgatggc ggagagatgc 
7081 attgatgcag ttattggtat caaagcaccc 
7141 cagataacag gcacagcctg cagagtcgca 

72 01 aaaccattgc cagtcgatga tacccaagac 
7261 gcaatgcgat ggaacagaat tggaattacg 

73 21 tacgagatca tactggcagg cctgatcatc 

73 81 aagaacttca agagcataag agggagccca 

74 41 acgtagtaga cacgcaccta cccaccggca 
7501 caccagttta ccctacaaat ccgatggctt 
7561 ggaggccgtt tcggcccccg ctggctgctc 
7621 acttgacttt caaacaacga tcacctaatc 
7681 agagtgctcc taagccaaaa cctactcagc 
7741 cgaaacgcaa gcctaaacca gggaaacgac 
7801 agacatttcc gatcatgctg aacggccaag 
7861 ggctgatgaa accactccac gttgaaggaa 
7921 aattgaagaa ggctagcatg tacgacttgg 
7981 cagacacgct gcagtacacc agcgacaaac 
8041 cagtccagta tgagaatggg agatttaccg 
8101 gcggaagacc gatcctggac aacagaggca 
8161 atgagggcac gcgtacggcg ctttcagtgg 
8221 gggatacccc cgaaggttct gaaccgtggt 
82 81 atgtcacgtt cccatgcgac aaaccacccg 
8341 tcgacgtgct cgaagagaac gtcgacaatc 
84 01 tgaaatgtcc atcacgccgg cccaaacgaa 
8461 cctacctggg gttctgcccg tattgcagac 
8521 ttgagaacgt gtgggacgaa tctgatgatg 
8581 tcggctacaa tcaggcaggc actgcggatg 
8641 acgaccatga catcaaggaa gacagtatgg 
8701 gccgtcgtct tggccacaaa gggtacttcc 
8761 taaccgtcag tatcacgagc ggagcatctg 
8821 ggaggaagtt tgtcggtaga gaggagtact 
8881 agtgccacgt ttacgatcac ttgaaggaga 
8941 caggcccaca cgcgtataag tcctatctgg 
9001 caccttctgg caagaacgtc acctacgaat 
9061 tgagcacgcg aacgaagatg aacggctgca 
9121 gcgaccaaac gaaatgggtc ttcaactcgc 
9181 tgcaaggtaa attgcacatt ccattccgct 
9241 ctcacacgcc tacagtcacg aagtggttca 
93 01 gaccaacatt gctgacaacg agaaaattgg 
9361 ttacagggtc tacatccagg aatttttctg 
9421 gtaaccatga accagtcaga gtctgggccc 
9481 ggccgcatga gatcatcatc cactattatc 
9541 tgtgtggtgt cgctcttgct atcctggtag 



actctcttcg acatgtcagc ggaagatttt 
ggcgacccag tattggaaac ggacatcgcg 
gccatttcgg cgttgatgat ccttgaggac 
atagaggcgg cgttcggcaa tatcacatct 
tttggtgcca tgatgaaatc cggtatgttc 
atcatgattg ctagcagagt actacgtgaa 
atcggcgacg ataacatagt gcatggtgtc 
gccacttggc tgaacatgga agtaaaaatt 
tacttctgtg ggggatttat cctggtggac 
gaccctctaa aaaggctttt taagcttgga 
tgcgaccgcc gccgggcact gcatgatgaa 
gacgagttag tgaaggccgt agaatccaga 
acgtctctgt ccacgttagc cgaaagcgtt 
atcaccctct acggctgacc taaataggtg 
gaatgtttcc ataccctcag ctgaactttc 
accgagatcc aaaccctcct aggcgccgct 
aaatcgaaga tcttaggagg tcgatagtca 
cgccgccagg tccaccgcca aagaagaaga 
ctaaaaagaa gaagcagcaa gccaagagga 
aacgtatgtg tatgaagttg gagtcggaca 
tgaatggata tgcctgcgtt gtcggaggaa 
aaattgataa tgagcaatta gcggccgtga 
agtacggcga cgttccccag aacatgaaat 
caccgggctt ctacaactgg caccacggcg 
taccgagagg agtgggcggg aaaggcgaca 
gagttgtggc tattgttcta ggaggtgcaa 
tcacttggaa ccagaaaggg gtgaccatta 
cactagttac agcgctatgc gtgctttcga 
tgtgctattc actgacgcca gaacgaacac 
caaattacga cacgctgctg gagaacgtct 
gcattaccga tgacttcaca ctgaccagtc 
actcaacgcc gtgtttcagc ccaataaaaa 
gatcgattag aatccaggtc tcggcacaat 
tcaccaaatt ccgttacatg tctttcgacc 
agaaaatagc tatcagcaca tctggaccct 
tgttagctca atgtcctcca ggtgacagtg 
agaattcatg caccgtggag aaaaagatca 
tgttcccacc cgtccatgga aagctggtaa 
cgtctgccgg gtacataacc atgcacaggc 
aggaagcgtc aggcgaagtg tacattaaac 
gtaagtgtgg cgactacagc acaggtatcg 
ctaaagcaaa acagtgcatt gcctacaaga 
cggatcttat taggcacaca gaccactcag 
tgacaccgac agtctgcccg gttccgttag 
aaggcatcac cctccacctg actgcaatgc 
ggctgcgagc agacgcaaca gcagaatgga 
tggggcgaga agggctggag tacgtatggg 
aggagtcggc accaggcgac cpacatggat 
atcggcatcc agtctacact gtcattgtgc 
gcactgcatc atcagcagct tgcatcgcca 



96 01 aagcaagaag agactgcctg acgccatacg cgcttgcacc gaacgcaacg gtacccacag 
9661 cattagcggt tttgtgctgc attcggccaa ccaacgctga aacatttgga gaaactttga 
9721 accatctgtg gtttaacaac caaccgtttc tctgggcaca gttgtgcatt cctctggcag 
9781 cgcttgttat tctgttccgc tgcttttcat gctgcatgcc ttttttattg gttgcaggcg 
9841 tctgcctggg gaaggtagac gccttcgaac atgcgaccac tgtgccaaat gttccgggga 
9901 tcccgtataa ggcgttggtc gaacgcgcag gttacgcgcc acttaacctg gagatcacgg 
9961 tcgtctcatc ggaattaaca ccttcaacta acaaggagta cgtgacctgc aaattccaca 
10021 cagtcattcc ttcaccacaa gttaaatgct gcgggtccct cgagtgcaag gcatcctcaa 
10081 aggcggatta cacatgccgc gtttttggcg gtgtgtaccc tttcatgtgg ggaggcgcac 
10141 aatgcttctg tgacagtgag aacacacaac tgagtgaggc gtacgtcgag ttcgctccag 
10201 actgcactat agatcacgca gtcgcactaa aagttcacac agctgctctg aaagtcggcc 
102 61 tgcgtatagt atacggcaac accaccgcgc acctggatac gtttgtcaat ggcgtcacgc 
10321 caggttcctc acgggacctg aaggtcatag cagggccgat atcagccgct ttttcaccct 
10381 btgaccataa ggtcgtcatc agaaaggggc ttgtttacaa ctacgacttc cctgagtatg 
10441 gagctatgaa accaggagcg ttcggcgata ttcaagcatc ctcgcttgat gctacagaca 
10501 tagtagcccg cactgacata cggctgctga agccttctgt caagaacatc cacgtcccct 
10561 acacccaagc agtatcaggg tatgaaatgt ggaagaacaa ctcaggacga cccctgcaag 
10621 aaacagcacc atttggatgt aaaattgaag tggagcctct gcgagcgtct aactgtgctt 
10681 acgggcacat ccctatctcg attgacatcc ctgatgcagc ttttgtgaga tcatcagaat 
10741 caccaacaat tttagaagtt agctgcacag tagcagactg catttattct gcagactttg 
10801 gtggttctct aacattacag tacaaagctg acagggaggg acattgtcca gttcactccc 
10861 actccacgac agctgttttg aaggaagcga ccacacatgt gactgccgta ggcagcataa 
10921 cactacattt tagcacatcg agcccacaag caaattttat agtttcgcta tgcggcaaga 
10981 agtccacctg caatgctgaa tgtaaaccac cggccgacca cataattgga gaaccacata 
11041 aagtcgacca agaattccag gcggcagttt ccaaaacatc ttggaactgg ctgcttgcac 
11101 tgtttggggg agcatcatcc ctcattgttg taggacttat agtgttggtc tgcagctcta 
11161 tgcttataaa cacacgtaga tgactgagcg cggacactga catagcggta aaactcgatg 
11221 tacttccgag gaagcgtggt gcataatgcc acgcgccgct tgacactaaa actcgatgta 
11281 tttccgagga agcacagtgc ataatgctgt gcagtgtcac attaatcgta tatcacacta 
11341 catattaaca acactatatc acttttatga gactcactat gggtctctaa tatacactac 
11401 acatatttta cttaaaaaca ctatacacac tttataaatt cttttataat ttttcttttg 
11461 tttttatttt gtttttaaaa tttc 
// 



SEQUENCE LISTING - pcDWXH-7 - SEQ ID NO 2 



<210> SEQ ID NO 2 

<211> LENGTH: 4150 

<212> TYPE: DNA 

<213> ORGANISM: WESTERN EQUINE ENCEPHALITIS VIRUS 

<214> FEATURES: 
Location/ Qualifiers 

source 1 . .4150 

/organism= "Western equine encephalomyelitis virus" 

/strains "71V- 1658" 

vector sequence 1-9 

5' Sacl primer 9-20 

3' end of NS4 gene 16-114 

intragenic region 115-159 

CDS 160-3869 

/note=" encodes capsid, E3, E2, 6K and El proteins" 
/codon_start=16 0 

/product = " structural polyprotein" 
/ 1 r ans 1 a t i on= " MF PYPQLNFPPVYPTNPMAYRDPNPPRRRWRPFRPPLAAQ I EDL 
RRS I VNLTFKQRS PNPPPGPPPKKKKSAPKPKPTQPKKKKQQAKRTKRKPKPGKRQRM 
CMKLE S DKTF P I MLNGQVNGYAC WGGRLMKPLHVEGKI DNEQLAAVKLKKASMYDLE 
YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTVPRGVGGKGDSGRPILDNR 
GRWAIVLGGANEGTRTALSWTWNQKGVTIRDTPEGSEPWSLVTALCVLSNVTFPCD 
KPPVCYSLTPERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRSITDDFTLTSPYLGF 



CPYCRHS TPCF S P I KI ENVWDE SDDGS I R I QVSAQFGYNQAGTADVTKFRYMS FDHDH 
DIKEDSMEKIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASENSCTVEKKIR 
RKFVGREEYLFPPVHGKLVKCHVYDHLKETSAGY I TMHRPGPHAYKS YLEEAS GEVYI 
KPPSGKNVTYECKCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSPDLIRHT 
DHSVQGKLHIPFRLTPTVCPVPLAHTPTVTKWFKGITLHLTAMRPTLLTTRKLGLRAD 
ATAEWITGSTSRNFSVGREGLEYVWGNHEPVRVWAQESAPGDPHGWPHEI I IHYYHRH 
PVYTVI VLCGVALAI LVGTAS SAAC I AKARRDCLTP YALAPNATVPTALAVLCC I RPT 
NAETFGETLNHLWFNNQPFLWAQLCIPLAALVILFRCFSCCMPFLLVAGVCLGKVDAF 
EHATTVPNVPGIPYKALVERAGYAPLNLEITWSSELTPSTNKEYVTCKFHTVIPSPQ 



?3 



VKCCGSLECKASSKADYTCRVFGGVYPFMWGGAQCFCDSENTQLSEAYVEFAPDCTID 



HAVALKVHTAALKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIAGPISAAFSPFDH 

KWI RKGLVYN YD F P E YGAMKPGAFGD I QAS S LDATD I VARTD I RLL KPS VKN I HVP Y 

TQAVSGYEMWKNNSGRPLQETAPFGCKIEVEPLRASNCAYGHI PI S IDI PDAAFVRSS 

ESPTI LEVS CTVADC I YSADFGGSLTLQYKADREGHC PVHSHS TTAVLKEATTHVTAV 

GSITLHFSTSSPQANFIVSLCGKKSTCNAECKPPADHIIGEPHKVDQEFQAAVSKTSW 
NWLLALFGGAS S L I WGL I VLVC S SML I NTRR " 

3'UTR 3870-4150 

BASE COUNT ORIGIN 

1 ggccctctag agctcatact ggcaggcctg atcatcacgt ctctgtccac gttagccgaa 
61 agcgttaaga acttcaagag cataagaggg agcccaatca ccctctacgg ctgacctaaa 
121 taggtgacgt agtagacacg cacctaccca ccggcagaat gtttccatac cctcagctga 
181 actttccacc agtttaccct acaaatccga tggcttaccg agatccaaac cctcctaggc 
241 gccgctggag gccgtttcgg cccccgctgg ctgctcaaat cgaagatctt aggaggtcga 
3 01 tagtcaactt gactttcaaa caacgatcac ctaatccgcc gccaggtcca ccgccaaaga 

3 61 agaagaagag tgctcctaag ccaaaaccta ctcagcctaa aaagaagaag cagcaagcca 
421 agaggacgaa acgcaagcct aaaccaggga aacgacaacg tatgtgtatg aagttggagt 

4 81 cggacaagac atttccgatc atgctgaacg gccaagtgaa tggatatgcc tgcgttgtcg 
541 gaggaaggct gatgaaacca ctccacgttg aaggaaaaat tgataatgag caattagcgg 
601 ccgtgaaatt gaagaaggct agcatgtacg acttggagta cggcgacgtt ccccagaaca 
661 tgaaatcaga cacgctgcag tacaccagcg acaaaccacc gggcttctac aactggcacc 
721 acggcgcagt ccagtatgag aatgggagat ttaccgtacc gagaggagtg ggcgggaaag 
781 gcgacagcgg aagaccgatc ctggacaaca gaggcagagt tgtggctatt gttctaggag 
841 gtgcaaatga gggcacgcgt acggcgcttt cagtggtcac ttggaaccag aaaggggtga 
901 ccattaggga tacccccgaa ggttctgaac cgtggtcact agttacagcg ctatgcgtgc 
961 tttcgaatgt cacgttccca tgcgacaaac cacccgtgtg ctattcactg acgccagaac 
1021 gaacactcga cgtgctcgaa gagaacgtcg acaatccaaa ttacgacacg ctgctggaga 
1081 acgtcttgaa atgtccatca cgccggccca aacgaagcat taccgatgac ttcacactga 
1141 ccagtcccta cctggggttc tgcccgtatt gcagacactc aacgccgtgt ttcagcccaa 
12 01 taaaaattga gaacgtgtgg gacgaatctg atgatggatc gattagaatc caggtctcgg 
1261 cacaattcgg ctacaatcag gcaggcactg cggatgtcac caaattccgt tacatgtctt 
1321 tcgaccacga ccatgacatc aaggaagaca gtatggagaa aatagctatc agcacatctg 
1381 gaccctgccg tcgtcttggc cacaaagggt acttcctgtt agctcaatgt cctccaggtg 
1441 acagtgtaac cgtcagtatc acgagcggag catctgagaa ttcatgcacc gtggagaaaa 
1501 agatcaggag gaagtttgtc ggtagagagg agtacttgtt cccacccgtc catggaaagc 
1561 tggtaaagtg ccacgtttac gatcacttga aggagacgtc tgccgggtac ataaccatgc 
1621 acaggccagg cccacacgcg tataagtcct atctggagga agcgtcaggc gaagtgtaca 
1681 ttaaaccacc ttctggcaag aacgtcacct acgaatgtaa gtgtggcgac tacagcacag 
1741 gtatcgtgag cacgcgaacg aagatgaacg gctgcactaa agcaaaacag tgcattgcct 
18 01 acaagagcga ccaaacgaaa tgggtcttca actcgccgga tcttattagg cacacagacc 
1861 actcagtgca aggtaaattg cacattccat tccgcttgac accgacagtc tgcccggttc 
1921 cgttagctca cacgcctaca gtcacgaagt ggttcaaagg catcaccctc cacctgactg 



1981 caatgcgacc aacattgctg acaacgagaa 
2041 aatggattac agggtctaca tccaggaatt 
2101 tatggggtaa ccatgaacca gtcagagtct 
2161 atggatggcc gcatgagatc atcatccact 
2221 ttgtgctgtg tggtgtcgct cttgctatcc 
2281 tcgccaaagc aagaagagac tgcctgacgc 
2341 ccacagcatt agcggttttg tgctgcattc 
2401 ctttgaacca tctgtggttt aacaaccaac 
2461 tggcagcgct tgttattctg ttccgctgct 
2521 caggcgtctg cctggggaag gtagacgcct 
2581 cggggatccc gtataaggcg ttggtcgaac 
2641 tcacggtcgt ctcatcggaa ttaacacctt 
2701 tccacacagt cattccttca ccacaagtta 
2761 cctcaaaggc ggattacaca tgccgcgttt 
2821 gcgcacaatg cttctgtgac agtgagaaca 
2881 ctccagactg cactatagat cacgcagtcg 
2941 tcggcctgcg tatagtatac ggcaacacca 
3001 tcacgccagg ttcctcacgg gacctgaagg 
3061 caccctttga ccataaggtc gtcatcagaa 
3121 agtatggagc tatgaaacca ggagcgttcg 
3181 cagacatagt agcccgcact gacatacggc 
3241 tcccctacac ccaagcagta tcagggtatg 
3301 tgcaagaaac agcaccattt ggatgtaaaa 
3361 gtgcttacgg gcacatccct atctcgattg 
3421 cagaatcacc aacaatttta gaagttagct 
3481 actttggtgg ttctctaaca ttacagtaca 
3541 actcccactc cacgacagct gttttgaagg 
3601 gcataacact acattttagc acatcgagcc 
3 661 gcaagaagtc cacctgcaat gctgaatgta 
3721 cacataaagt cgaccaagaa ttccaggcgg 
3781 ttgcactgtt tgggggagca tcatccctca 
3 841 gctctatgct tataaacaca cgtagatgac 
3 901 tcgatgtact tccgaggaag cgtggtgcat 
3 961 gatgtatttc cgaggaagca cagtgcataa 
4021 acactacata ttaacaacac tatatcactt 
4081 cactacacat attttactta aaaacactat 
4141 ctttaggttt 
// 



aattggggct gcgagcagac gcaacagcag 
tttctgtggg gcgagaaggg ctggagtacg 
gggcccagga gtcggcacca ggcgacccac 
attatcatcg gcatccagtc tacactgtca 
tggtaggcac tgcatcatca gcagcttgca 
catacgcgct tgcaccgaac gcaacggtac 
ggccaaccaa cgctgaaaca tttggagaaa 
cgtttctctg ggcacagttg tgcattcctc 
tttcatgctg catgcctttt ttattggttg 
tcgaacatgc gaccactgtg ccaaatgttc 
gcgcaggtta cgcgccactt aacctggaga 
caactaacaa ggagtacgtg acctgcaaat 
aatgctgcgg gtccctcgag tgcaaggcat 
ttggcggtgt gtaccctttc atgtggggag 
cacaactgag tgaggcgtac gtcgagttcg 
cactaaaagt tcacacagct gctctgaaag 
ccgcgcacct ggatacgttt gtcaatggcg 
tcatagcagg gccgatatca gccgcttttt 
aggggcttgt ttacaactac gacttccctg 
gcgatattca agcatcctcg cttgatgcta 
tgctgaagcc ttctgtcaag aacatccacg 
aaatgtggaa gaacaactca ggacgacccc 
ttgaagtgga gcctctgcga gcgtctaact 
acatccctga tgcagctttt gtgagatcat 
gcacagtagc agactgcatt tattctgcag 
aagctgacag ggagggacat tgtccagttc 
aagcgaccac acatgtgact gccgtaggca 
cacaagcaaa ttttatagtt tcgctatgcg 
aaccaccggc cgaccacata attggagaac 
cagtttccaa aacatcttgg aactggctgc 
ttgttgtagg acttatagtg ttggtctgca 
tgagcgcgga cactgacata gcggtaaaac 
aatgccacgc gccgcttgac actaaaactc 
tgctgtgcag tgtcacatta atcgtatatc 
ttatgagact cactatgggt ctctaatata 
acacacttta taaattctct cataatttca 



SEQUENCE LISTING - pVHX-6 - SEQ ID NO 3 



<200> SEQUENCE CHARACTERISTICS 

<210> SEQ ID NO 3 

<211> LENGTH: 1260 

<212> TYPE: DNA 

<213> ORGANISM: EXAMPLE OF A CMV PROMOTER COUPLED TO THE pcDW-XH7 

SEQUENCE 

<22 0> FEATURE: 

1-302 pVAX vector sequence 

302-401 CMV promoter 

411 CMV putative transcriptional start site 

444-463 T7 promoter 

464-509 pVAX multicloning region 

510 start of pcDW-HX45 sequence 



1 cattgacgtc aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg 
61 tatcatatgc caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat 
121 tatgcccagt acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc 
181 atcgctatta ccatggtgat gcggttttgg cagtacatca atgggcgtgg atagcggttt 
241 gactcacggg gatttccaag tctccacccc attgacgtca atgggagttt gttttggcac 
3 01 caaaatcaac gggactttcc aaaatgtcgt aacaactccg ccccattgac gcaaatgggc 
3 61 ggtaggcgtg tacggtggga ggtctatata agcagagctc tctggctaac tagagaaccc 
421 actgcttact ggcttatcga aattaatacg actcactata gggagaccca agctggctag 
481 cgtttaaact taagcttggt accgagctca tactggcagg cctgatcatc acgtctctgt 
541 ccacgttagc cgaaagcgtt aagaacttca agagcataag agggagccca atcaccctct 
601 acggctgacc taaataggtg acgtagtaga cacgcaccta cccaccgcca gaatgtttcc 
661 ataccctcag ctgaactttc caccagttta ccctacaaat ccgatggctt accgagatcc 
721 aaaccctcct aggcgccgct ggaggccgtt tcggcccccg ctggctgctc aaatcgaaga 
781 tcttaggagg tcgatagtca acttgacttt caaacaacga tcacctaatc cgccgccagg 
841 tccaccgcca aagaagaaga agagtgctcc taagccaaaa cctactcagc ctaaaaagaa 
901 gaagcagcaa gccaagagga cgaaacgcaa gcctaaacca gggaaacgac aacgtatgtg 
961 tatgaagttg gagtcggaca agacatttcc gatcatgctg aacggccaag tgaatggata 
1021 tgcctgcgtt gtcggaggaa ggctgatgaa accactccac gttgaaggaa aaattgataa 
1081 tgagcaatta gcggccgtga aattgaagaa ggctagcatg tacgacttgg agtacggcga 
1141 cgttccccag aacatgaaat cagacacgct gcagtacacc agcgacaaac caccgggctt 
12 01 ctacaactgg caccacggcg cagtccagta tgagaatggg agatttaccg taccgagagg 

.... CONTINUOUS WITH pcDW-HX45 (pcDW-XH7) SEQUENCE 

4177 gaatgtaaac caccggccga ccacataatt ggagaaccac ataaagtcga ccaagaattc 
42 3 7 caggcggcag tttccaaaac atcttggaac tggctgcttg cactgtttgg gggagcatca 

42 97 tccctcattg ttgtaggact tatagtgttg gtctgcagct ctatgcttat aaacacacgt 

43 57 agatgactga gcgcggacac tgacatagcg gtaaaactcg atgtacttcc gaggaagcgt 
4417 ggtgcataat gccacgcgaa gcttgacact aaaactcgat gtatttccga ggaagcacag 
4477 tgcataatgc tgtgcagtgt cacattaatc gtatatcaca ctacatatta acaacactat 
4537 atcactttta tgagactcac tatgggtctc taatatacac tacacatatt ttacttaaaa 
4597 acactataca cactttataa attcttttat aatttttctt ttgctttaga gcacactggc 



4 657 ggccgttact agtggatccg agctctagag ggcc 

4177-4643 pcDW-HX45 
4644-4790 vector sequence // 



<7 



